side effect
Embeddings as Probabilistic Equivalence in Logic Programs
The integration of logic programs with embedding models resulted in a class of neurosymbolic frameworks that jointly learn symbolic rules and representations for the symbols in the logic (constant or predicate). The key idea that enabled this integration was the differentiable relaxation of unification, the algorithm for variable instantiation during inference in logic programs. Unlike unification, its relaxed counterpart exploits the similarity between symbols in the embedding space to decide when two symbols are semantically equivalent. We show that this similarity between symbols violates the transitive law of equivalence, leading to undesirable side effects in learning and inference. To alleviate those side effects, we are the first to revamp the well-known possible world semantics of probabilistic logic programs into new semantics called equivalence semantics. In our semantics, a probabilistic logic program induces a probability distribution over all possible equivalence relations between symbols, instead of a probability distribution over all possible subsets of probabilistic facts. We propose a factorization of the equivalence distribution using latent random variables and characterize its expressivity. Additionally, we propose both exact and approximate techniques for reasoning in our semantics. Experiments on well-known benchmarks show that the equivalence semantics leads to neurosymbolic models with up to 42% higher results than state-of-the-art baselines.
New cancer tech sends chemo straight to tumors
This material may not be published, broadcast, rewritten, or redistributed. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset . Powered and implemented by FactSet Digital Solutions . Mutual Fund and ETF data provided by LSEG . Apple's $250M Siri settlement: Are you owed cash?
China Approves the First Brain Chips for Sale--and Has a Plan to Dominate the Industry
While the United States and Europe are moving cautiously forward with clinical trials, China is racing toward the commercialization of brain implants. China has made history by becoming the first nation to approve a commercially available brain chip to treat a disability. NEO, the implant developed by Neuracle Medical Technology, translates the thoughts of a person with paralysis into movements of an assistive robotic hand. After 18 months of testing that proved its safety, China's National Medical Products Administration authorized the implant for people aged 19 to 60 with paralysis caused by neck or spinal cord injuries that prevent them from moving their limbs. According Nature, the implant embedded in the skull is about the size of a coin.
FDA Approves Pill Version of Wegovy
Novo Nordisk's semaglutide will soon be available in a daily pill Americans can take for weight loss. The US Food and Drug Administration today approved a pill version of the blockbuster anti-obesity drug Wegovy. Made by Novo Nordisk, the pill is taken once a day. The company's original version of Wegovy is a weekly injection. Both drugs contain the same active ingredient, semaglutide.
Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability
Raimondi, Bianca, Dalbagno, Daniela, Gabbrielli, Maurizio
Large language models (LLMs) have been shown to internalize human-like biases during finetuning, yet the mechanisms by which these biases manifest remain unclear. In this work, we investigated whether the well-known Knobe effect, a moral bias in intentionality judgements, emerges in finetuned LLMs and whether it can be traced back to specific components of the model. We conducted a Layer-Patching analysis across 3 open-weights LLMs and demonstrated that the bias is not only learned during finetuning but also localized in a specific set of layers. Surprisingly, we found that patching activations from the corresponding pretrained model into just a few critical layers is sufficient to eliminate the effect. Our findings offer new evidence that social biases in LLMs can be interpreted, localized, and mitigated through targeted interventions, without the need for model retraining.